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270 1 TTATTATCCAGAACATCTA GTTAATCATTACT TCCAAACTAGA CACTATTTACAC ACTCT 

HNF1 HNF3 

276 1 ATGGAAGGCGGGTATA TTATATAA GAGAGA A AGA AC AC AT A ccciCCTC ATTTrcrnnr^Tr^ 
Spl TBP RNA Start 

282 1 ACCATATTCTTGGGAACAAGATCTACAGC ATGGGGC 

PreS 1 protein start 



Fig. IB 
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6500 -- 



-O— AGACACTATTTACACACTC/HNF3-wt 

GGTATATTATATAAGAGAG/TBP-wt 
-A— GGAATATGCGCGCCGAGAG/TBP-mut 
-■— CTAGTTAATCATTACTTCC/HN F1 -wt 
-O— CTATCGCCGACGGCAGTCC/HNF1 -m 
-•— " CTAGTTAATAATTAATTCC/HNF1 -21 X 
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Fig. 2 



1081 CTA AGC AGG CTT TCA CTT T CT CGC CAA CTT ACA AGG CCT TTC TGT GTA AAC AAT 

NF1 (1100-1119) 

2c (1119-1134) 

1135 ACC TGA ACC TTT A CC CCG TTG CCC GGC AAC GGC CA G GTC TGT GCC AAG TGT TTG 

EF-C (1148-1168) 

1189 CTG ACG CAA CCC CCA CTG GCT GGG GCT TGG TCA TGG GCC ATC AGC GCA TGC GTG 

E(1180-1202) NF1 (1209-1236) X-PBP (1229-1245) 

1243 GAA CCT TTT CGG CTC CTC TGC CGA TCC ATA CTG CGG AAC TCC TAG CCG CTT GTT 



12 97 TTG CTC GCA GCA GGT CTG GAG CAA ACA TTA TCG GGA CTG ATA ACT CTG TTG TCC 



1351 TAT CCC GCA AAT ATA CAT CGT TTC CAT GGC TGC TAG 138 6 



Fig. 3 



CAGCTGGG CCGCCCTTGT 

GCGGGGGCCA ACGCGATTGT GGGTGCTCGG 

CCCTGCTGGG GCAACCCATC GCTCCCCATG 

CCCGGAATAT TAGTAATCCT AATTCCCGGC 

GAAAGGTGGG GGTGGGGGGG GTCGCATCTT 

CTTTTTCTAT CAGTTTTCTT TGAGCTTTTA 

CTGAACTATA TTCAAAAGGA AGTAAATGAA 

AAAATAAAAA CAAAAGTTAA GACAGTAAAA 

ACAGAACCTG TAATTTTAAA AACTGTGTAT 

ATATTGGGGA CCCTCTCATG TAACCACGAA 

GTACACTCGT TTGTTTAATT GATAATTGTT 

CACGCTCACG AATTCAGTCC CAGGGCAAAT 

ACAAAACCAA TTAGGAACTT CGGTGGTCTT 

CAATTTAATT TCTTTTTTAA TTAAAAAAAA 

CTTTCCATTC AGAGGTGTGT TTCTCCCGGT 

CAGTTGGGGA CCCCCGCAAG GACCGACTGG 

CAGGCTAGAA GGACAAGATG AAGGAAATGC 

TTCGGGCATT TATTTTATTT TATTTTTTGA 

ACTTTTAGGG TTACCCCCTT GGGCATTTGC 

TGCACAGGGG TTGTGTGCCC GGTCCTCCCC 

TTACACGTGT TAATGAAAAT GAAAGAAGAT 

CGCCCGTGGG TGCCCTCGTG GCGTTCTTGG 

GGGTGTCGCC GCGCCCCAGT CACCCCTTCT 

GCCTTCCTAG TTGTCCCCTA CTGCAGAGCC 

ACCCACTCGA GGCGGACGGG GCCCCCTGCA 

AGCGGGGCGA TTTGCATTTC TATGAAAACC 

GGCGCGGCGC CTCAGGGATG GCTTTTGGGC 

CCCGCGCCCC CTCCCCCTGC GCCCGCCCCC 

CTTTGATCTT TGCTTAACAA CAGTAACGTC 

TGCAAAGTCC TGGAGCCTCC AGAGGGCTGT 

CACGCTCCGG CGAGGGGCAG AAGAGCGCGA 

AGCGCGGACC CAGCCAGGAC CCACAGCCCT 



GCGCGGGCTG ATGCTCTGAG GCTTGGCTAT 
GGAGTGGGGG GGGGCACGAC CGTAGGTGCT 
CGGAATCCGG GGGTAATTAC CCCCCCAGGA 
GGGGGAGGGG GCGCGGGAGG AATTCACCCT 
GCTGTGAGCA CCCTGGCGAA GGGGAGAGGG 
CTGTTAAGAG GGTACGGTGG TTTGATGACA 
CAGTTTTCTT AATTTGGGGC AGGTACTGTA 
TGTCCTTTTA TTTTTTAATG CACCAAAGAG 
TTTAATTTAC ATCTGCTTAA GTTTGCGATA 
CACCTATCGA TTTTGCTAAA AATCAGATCA 
CTGAATTATG CCGGCTCCTG CCAGCCCCCT 
TCTAAAGGTG AAGGGACGTC TACACCCCCA 
GTCCCAGGCA GAGGGGACTA ATATTTCCAG 
TGAGTCAGAA TGGAGATCAC TGTTTCTCAG 
TAAATTGCCG GCACGGGAAG GGAGGGGGTG 
TCAAGGTAGG AAGGCAGCCC GAAGAGTCTC 
TGGCCACCAT CTTGGGCTGC TGCTGGAATT 
GCGAGCGCAT GCTAAGCTGA AATCCCTTTA 
AACGACGCCC CTGTGCGCCG GAATGAAACT 
GTCCTTGCAT GCTAAATTAG TTCTTGCAAT 
GCAGTCGCTG AGATTCTTTG GCCGTCTGTC 
AAATGCGCCC ATTCTGCCGG CTTGGATATG 
CGTGGTCTCC CCAGGCTGCG TGCTGTGCCG 
ACCTCCACCT CACCCCCTAA ATCCCGGGGG 
CCCCTCTTCC CTGGCGGGGA GAAAGGCTGC 
GGACTACAGG GGCAACTCCG CCGCAGGGCA 
TCTGCCCCTC GCTGCTCCCG GCGTTTGGCG 
GCCCCCCTCC CGCTCCCATT CTCTGCCGGG 
ACACGGACTA CAGGGGAGTT TTGTTGAAGT 
CGGCGCAGTA GCAGCGAGCA GCAGAGTCCG 
GGGAGCGCGG GGCAGCAGAA GCGAGAGCCG 
CCCCAGCTGC CCAGGAAGAG CCCCA 



Fig. 4 



10 20 30 40 50 60 70 

GAATTCACTG GGGAGAGCAT TCAGGAAGAT GACAACAGGA TAATAGGTCA ACAGAGTAAT AGAGAGGTCG 
CTTAAGTGAC CCCTCTCGTA AGTCCTTCTA CTGTTGTCCT AT TATCCAGT TGTCTCATTA TCTCTCCAGC 

80 90 100 110 120 130 140 

CTAAAAATAA ACT CTAAGAA GTATTCAGCC AAAACTATTA TTGAGCTAAT AATGGTGGGA TCAATTTCAG 
GATTTTTATT TGAGATTCTT CATAAGTCGG TTTTGATAAT AACTCGATTA TTACCACCCT AGTTAAAGTC 

150 160 170 180 190 200 210 

GGGAATATTG TGGGCAGAAG TCAGACTGTA GGAGGCTGGG GATCAAGAAG TTGAGGCAAG GAGGTTGGAC 
CCCTTATAAC ACCCGTCTTC AGTCTGACAT CCTCCGACCC CTAGTTCTTC AACTCCGTTC CTCCAACCTG 

220 230 240 250 260 270 280 

AACAACTGTT TTTTCAAGTT GGTCACGTGA ACAAATCTGT GACCTTCAGC CTCCCCTCCC TCGGGTCTTG 
TTGTTGACAA AAAAGTTCAA CCAGTGCACT TGTTTAGACA CTGGAAGTCG GAGGGGAGGG AGCCCAGAAC 

290 300 310 320 330 340 - 350 

GCTGAGCTGA TTGCAGGGCC CCTGCAGCTC TGGCACTCTC AAGTTGTATA AAACTGACAG TGCAGAAGTC 
CGACTCGACT AACGTCCCGG GGACGTCGAG ACCGTGAGAG T T C AAC AT AT TTTGACTGTC ACGTCTTCAG 

360 370 380 390 400 410 420 

CTTGAGCCCA TTTTGGCTCT CATGATAATT TTCCTTCAGT GGAACTAAGG TTACTTGTCT AAGAAC C AAA 
GAACTCGGGT AAAACCGAGA GTACTATTAA AAGGAAGTCA CCTTGATTCC AATGAACAGA TTCTTGGTTT 

430 440 450 460 470 480 490 

GCCTCTGACT TGACTGATCA AAGTTCATCA CGTGCATCGA AGCCACCTAC TTGGCAGATG TAGTGAAAAG 
CGGAGACTGA ACTGACTAGT TTCAAGTAGT GCACGTAGCT TCGGTGGATG AACCGTCTAC ATCACTTTTC 

500 510 520 530 540 550 560 

CT AC ATA GAT CTGGGCCCAG GACAGGATGC TGGGGCGTGG GAGGGGAAGA AAGCAGGTGC TAACTATATA 
GATGTATCTA GACCCGGGTC CTGTCCTACG ACCCCGCACC CTCCCCTTCT TTCGTCCACG ATTGATATAT 

570 580 590 600 610 620 630 

GATAGCATGC CTATCAGAGC AGTTTTTACG TTTCCTATTT GTCTCTCAAA ACAATTTTAT AGGAATCATC 
CTATCGTACG GATAGTCTCG TCAAAAATGC AAAGGATAAA CAGAGAGTTT TGTTAAAATA TCCTTAGTAG 

640 650 660 670 680 690 700 

AAAGCAATTT TATCATGGTT TCTAGACCAG GTTTGGATGT GAGGTAGGGA TTTCCACAGC TGCTTTTAGT 
TTTCGTTAAA ATAGTACCAA AGATCTGGTC CAAACCTACA CTCCATCCCT AAAGGTGTCG ACGAAAATCA 

710 720 730 740 750 760 770 

TTGAAGGAAA TCTGATAAGA TGATGCAAAA GCCCTTCAGA AATGTGTAAT CCTACACACT TCAGTGATTC 
AACTTCCTTT AGACTATTCT ACTACGTTTT CGGGAAGTCT TTACACATTA GGATGTGTGA AGTCACTAAG 

780 7 90 800 810 820 830 840 

AATTCATTGT CAAAACTTAA GGTGTTTTTA ATATTGTTAT TGTTCATTTG GTTTTTACCA ACATGTAAGG 
TTAAGTAACA GTTTTGAATT CCACAAAAAT TATAACAATA ACAAGTAAAC CAAAAAT GGT TGTACATTCC 

850 860 870 880 890 900 910 

AGTTGGCAAT TATTTGTTAA ACTCATGTCT TAGGCTAAAT AAATTCCAAA AAATTCAGGA TGAGAATTGT 
TCAACCGTTA ATAAACAATT TGAGTACAGA ATCCGATTTA TTTAAGGTTT TTTAAGTCCT ACTCTTAACA 



Fig . 5A 



920 930 940 950 960 970 980 

TTATTGCTTA ACGTGTGTCA AATTTCTTCC ATGCACATCT TTATTAGATC TTCACAGCAA CCTACAGGAT 
AATAACGAAT TGCACACAGT TTAAAGAAGG TACGTGTAGA AATAATCTAG AAGTGTCGTT GGATGTCCTA 



990 1000 1010 1020 1030 1040 1050 

AAGCAAGACA GGTGCAAGTG CCTCCTTTGG GTATGAGGAA ACTGAGGTCT AAAGAGATGA AGTGATTTGC 
TTCGTTCTGT CCACGTTCAC GGAGGAAACC CATACTCCTT TGACTCCAGA TTTCTCTACT TCACTAAACG 



1060 1070 1080 1090 1100 1110 1120 

CCAAGGCTCA TAGCAATTTA TTGGTAGAGC AAAGACTAGA ATTCTCTTAA CTGCAGCCTA TTTTCCCTAT 
GGTTCCGAGT ATCGTTAAAT AACCATCTCG TTTCTGATCT TAAGAGAATT GACGTCGGAT AAAAGGGATA 



1130 1140 1150 1160 1170 1180 1190 

TCTGAACTGT TACATCAGCA TCAACAATTA TCTAATGGAT TGGAACAGTG TACACAGGCA GCTTAGCTAC 
AGACTTGACA ATGTAGTCGT AGTTGTTAAT AGATTACCTA ACCTTGTCAC ATGTGTCCGT CGAATCGATG 



1200 1210 1220 ' 1230 1240 1250 1260 

GTCAAGTCAC GATTTTTACT TTAACTTCAA TTCCAGAGTC TTGGCCTGAT TTCCCTCAAG ACCCTACTTA 
CAGTTCAGTG CTAAAAATGA AATTGAAGTT AAGGT CTCAG AACCGGACTA AAGGGAGTTC TGGGATGAAT 



1270 1280 1290 1300 1310 1320 1330 

TCTTTGGCTT TGGAAAATTT ATTTTTCTTG CATTATCTTT CCAGCTAAAT TTTATTTAAT AACCATCAGC 
AGAAACCGAA ACCTTTTAAA TAAAAAGAAC GTAATAGAAA GGTCGATTTA AAATAAATTA TTGGTAGTCG 



1340 1350 1360 1370 1380 1390 1400 

ATGCTTTTTT TGCTTTATGC CAT G TAG ACT TGACCTGAAA ACCTGCCAGG CTTTCATTGA GTTTAGTGAT 
TACGAAAAAA ACGAAATACG GTACATCTGA ACTGGACTTT TGGACGGTCC GAAAGTAACT CAAATCACTA 



1410 1420 1430 1440 1450 1460 1470 

TAAAGAAGTA AAGTTCTGAG AAGCAATTAG TTGATGGGAC ACCAGTCATA AAATCAATCC AAACTTTTGT 
ATTTCTTCAT TTCAAGACTC TTCGTTAATC AACTACCCTG TGGTCAGTAT TTTAGTTAGG TTTGAAAACA 

1480 1490 1500 1510 1520 1530 1540 

TGACATGTGT TTCTTTCTCC ATATACCAGG TTCCCGCTTC GTATTAGTAA GATTGAAATT GAAATAAGTC 
ACT GT ACACA AAG AAA GAG G TATATGGTCC AAGGGCGAAG CATAATCATT CTAACTTTAA CTTTATTCAG 



1550 1560 1570 1580 1590 1600 1610 

TATTGCTGGT GGATGAATTT GTCACTTTCC TTGAAACTGG TGAACCCAAA AAGTTAGACA GTGATAGGAA 
ATAACGACCA CCTACTTAAA CAGT GAAAGG AACTTTGACC ACTTGGGTTT TTCAATCTGT CACTATCCTT 



1620 1630 1640 1650 1660 1670 1680 

AATACTGCCA TTGTCTGTTA AGAAGTCTAT GACATTTCAA GGCAAGAATG AATATATGGA AGAAGAAACT 
TTATGACGGT AACAGACAAT TCTTCAGATA CTGTAAAGTT CCGTTCTTAC TTATATACCT TCTTCTTTGA 



1690 1700 1710 1720 1730 1740 1750 

TGTTTCTTCT TTACTTACAA AAAGGAAAGC CTGGAAGTGA ATGATATGGG TAT AAT T AAA AAAAAAAAAA 
ACAAAGAAGA AATGAATGTT TTTCCTTTCG GACCTTCACT TACTATACCC ATATTAATTT TTTTTTTTTT 



1760 1770 1780 1790 1800 1810 1820 

AAAACAAAAA ACCTTTACGT AACGTTTTGC TGGGAGAGAA G AC T AC GAAG CACATTTTCC AGGAAGT GT G 
TTTTGTTTTT TGGAAATGCA TTGCAAAACG ACCCTCTCTT CTGATGCTTC GT GTAAAAGG TCCTTCACAC 



Fig . 5B 



1830 
GGCTGCAACG 
CCGACGTTGC 

1900 
CTCTGCCACC 
GAGACGGTGG 

1970 
CTTCTCCCCG 
GAAGAGGGGC 

2040 
TCTTATCACC 
AGAATAGTGG 

2110 
TGAAC CACAA 
ACTTGGTGTT 

2180 
GGATGATGGT 
CCTACTACCA 

2250 
CATATTCTTG 
GTATAAGAAC 

2320 
CCATACCATC 
GGTATGGTAG 

2390 
GTCCATTTCA 
CAGGTAAAGT 



1840 1850 1860 1870 1880 1890 

ATTGTGCGCT CTTAACTAAT CCTGAGTAAG GTGGCCACTT TGACAGTCTT CTCATGCTGC 

TAACACGCGA GAATTGATTA GGACTCATTC CACCGGTGAA ACTGTCAGAA GAGTACGACG 

1910 1920 1930 1940 1950 1960 

TTCTCTGCCA GAAGATACCA TTTCAACTTT AACACAGCAT GATCGAAACA TACAACCAAA 
AAGAGACGGT CTTCTATGGT AAAGTTGAAA TTGTGTCGTA CTAGCTTTGT ATGTTGGTTT 

1980 1990 2000 2010 2020 2030 

ATCTGCGGCC ACTGGACTGC CCATCAGCAT GAAAATTTTT ATGTATTTAC TTACTGTTTT 
TAGACGCCGG TGACCTGACG GGTAGTCGTA CTTTTAAAAA TACATAAATG AATGACAAAA 

2050 2060 2070 2080 2090 2100 

CAGATGATTG GGTCAGCACT TTTTGCTGTG TATCTTCATA GAAGGCTGGA CAAGGTAAGA 

GTCTACTAAC CCAGTCGTGA AAAACGACAC ATAGAAGTAT CTTCCGACCT GTTCCATTCT 

2120 2130 2140 2150 2160 2170 

GCCTTTATTA ACTAAATTTG GGGTCCTTAC T AAT T CAT AG GTTGGTTCTA CCCAAATGAT 

CGGAAATAAT TGATTTAAAC CCCAGGAATG AT T AAGT AT C CAACCAAGAT GGGTTTACTA 

2190 2200 2210 2220 2230 2240 

AGAAAC C AAA TAGAAGAATG GTCTTGTGGC ATAATGTTTG TTCCCTAGTC AATGAACTCT 

TCTTTGGTTT ATCTTCTTAC C AG AAC AC C G TATTACAAAC AAGGGATCAG TTACTTGAGA 

2260 2270 2280 2290 2300 2310 

TCTCTGGTTA GGATCTTGGG ATCTGGAGTC AGACTGCCTG GGCTCAAATC TTGGCTCTGC 

AGAGAC CAAT CCTAGAACCC TAGACCTCAG TCTGACGGAC CCGAGTTTAG AACCGAGACG 

2330 2340 2350 2360 2370 2380 

TCTGTTATCC TGGGGCAAGT GCCTCAGTTT CCACATCTGA GAAATGGGGA TGGTAGTGGT 

AGACAATAGG ACCCCGTTCA C GGAGT C AAA GGTGTAGACT CTTTACCCCT ACCATCACCA 



TAG AT 
ATCTA 



Fig . 5C 



GAGATGTATATAATTTTTTAGGAAAATCTCAAGGTTATCTTTACTTTTTCTTA 
GGAAATTAACAATTTAATATTAAGAAACGGCTCGTTCTTACACGGTAGACTTA 
ATACCGTAAGAACGAGCCGTTTTCGTTCTTCAGAGAAAGATTTGACAAGATTA 

CCATTGGCATCCCCGTTTTATTTGGTGCCTTTCACAGAAAGGGTTGGTCTTAA 
TT 



Fig. 6 
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TCTAGAAAAT AATTCCCAAT ATTGAATCCC 
AAGATAAGTT GAATTTGGTC ATGAAGGAAG 
ACAGCAAGGT CCAAAGACAG TCAGGTCTTC 
AGATGTCTGA TGTGAACTAA AAAGATTGGT 
GCTGTAAAGA AAAATTGAAA CTTGATTGTG 
GGACATGAAT CTGCAGGCCA GTGTTTGCAG 
CACAGACATG TGTTGTTTGG CCTGCAGAGG 
GAATTAGTAG CCATCATTTT CAAAAATCAA 
TGTTCTTGAA AATAATGAAA CATCTGGAAC 
CAGTTGGAGC TGCGTAGTGA CTGCCCACTT 
CAGCTGCCAC CACTTCCCTT TATCTCTCTA 
CTTAAGAGAT ATTTGGGTTT GAAACCTCTG 
GACAGATGCT ATCAGAGTTG TCTTTTAAGA 
ATAGCCTGGA AGAGGGCAGC CAGGGGAGAA 
ATGAGATGAT GGCTTCAACA TTGAGGACAG 
ATAAGCATCT ATGCAAAGGA AATAGCAGAT 
AAAGTTTATT CATAAGCCCC TCTTTTCATC 
TTGTTCTTCA CTCTCCTTCC TGGAT CATGA 
TCCTATTCAT CAGTGAATTC CAAGTGCCTA 
ACAAATATTC GTTCAGTGCA TGAATTGGAG 
AAAGGTTTAT TACATACCTA AATATGCTGA 
CCAAAACTCC CAGTTTGGAA ACAGAATTTT 
TTCAAAAAAT AT TT ATT AAA CAATGACCAT 
ATGATGAACA AAAACCAACA TGATTCCTGC 
ACAGATTGTG AGAAGTCCTC CAACAAATAC 
GTGGGTGAGG GACTTGAGCT AGCCTAGGTG 
TGCATAGGCA TTAACTAGAT AAAT A GAG AG 
ACAGAATATG CAAAGGCAGG AAGGATTATG 
GTGTAAGTGG ATCTCAGAAA ACAGGAGGAA 
TCATGTGGGG TATGGTTAAT GTTTTGGACT 
CAGGACTTAA ACGGGGAAAT AATATGACCA 
AATATATATT GAGAGTTAAT TAT T GAC AAA 
AGTATTTTCA AAAAATGGTG CCAAAACAAT 
TCTACAAAAC TTAATTCAAA ATGGATCACA 
CTTCTGGAAG GAAAACACAG ATGGGAATCT 
ATATCTGACA CCATAATCCG TAAGTTAACA 
GACTTCATCA GAATTTAAAA TGTTTGTGCT 
AACTACAGAT AAGATATTGT TGAATCATAT 
ATAACTCTAA ACCCCCATAA TAAATTACAA 
AAATTTACAG TCTTCATCAA AGAAAGTATC 
TGCATCTTTA TTCATGGGGG GAT GAAAT AA 
AATACTAAAA TTAAAAAGAC TGAC CATACC 
TACAATCAAC TTAGGTAGAT GATTTGGAAG 
TGTGACTCAC CCATTCCATG GCTAAGTATT 
ACAAAGATGT T TAT AC AAAT ATTTATAGCA 
GAACCCAAAT GTCCATCAAA AGTGAATGGA 
AATACTACTT AGCAATAAAG AAGAATGAGC 
TCAAAGGCAT TGTGCTCAGT GAAAGATGCA 
AGTTCCATTG ACATAAAACT CTGGAAAGTG 
CATTGGTTGC CTGAGGAGGA GGAGTATAGG 
CAATAAAAAC TTTTGGAATC ATAGATATAT 



AAAGAATTCA ACATTTGGGC TGTCGTTTGA 61 
AGAGGGGGGA TACAATTTCA GTAAAAGGTA 121 
AGTAGTATGG AGTATATTCA GAGGGAGCCA 181 
GGTTGGTAGG AGGAAGAGGT GTGAGAAGAG 2 41 
ATGGACTTTA AAGGCTAGGC TATGGGACTT 301 
ACTGGCGCCC ATAACTGTCT AT CACAGC AA 3 61 
TTTGGCCTGC AT GAT GAT TT TAAACCATCT 421 
GAGATGCCAC ATTAAAATAT GGAATGCTGC 481 
ATTGAGGCCA CATTCCTGAC T GAC AGCAAT 541 
TACATGGGGC ATCTGATCCC TAGTCGATTA 601 
ATACCAAGCT CTTTTCACTC ATTTTTGTTA 661 
ATGCAGGTAA TTGAGGGTTA TAGAGCAGAG 721 
AAGAACCCTC TGTTCTTCAT TTTGTTGAAG 781 
GTTAGGGCTG GAGCTATGAG AAAGCATAAG 841 
AAAGAATATT GAGAT GAGAA AGTAGTCCAT 901 
GTCCTCAAAT CAGCAGAGGC AACAACTCTG 961 
TCCAATCCAG TTCAAATGTA AT TAT TT AAA 1021 
ATGAGCTCCT TAAATGCAGG GTCCACAGTG 1081 
GCACAGAGCC TGGCAAATAG TAAATGCTTA 1141 
TGATTCTCTA CTTTGCCTCA TAAGTTGAAA 12 01 
AATCACAGGG CATTTGGCAA CCCCCCAAAA 12 61 
AATTCTGTGA AAATAAAATC CATTCATTTA 1321 
GTCCACACCA GGCTGAGTCC TAAGGATTCA 1381 
TCTTAGGAAA CATACAGTTC AGTGAGGAAA 14 41 
TGGGTGCTAT TAAAATATAT TAAAAGGTGA 1501 
GTTCAGGAAG TCTTCCTGGA TGTGCTGATA 15 61 
AAGGATGAAC CAACATT GCA GGTAGAGGGA 1621 
GAGTCGTTGG AGGACCTGAA TAAAGGCCCA 1681 
AGGTGTATGA GATGAGATCA GAGAGGCAGA 1741 
TTTCTATTAA GAGCAATGGG GAGACAGTGA 1801 
GATTAAACTT TCTAAAAAAC CCTCTATGCA 18 61 
GATTCAAAGG CAACAAAGTG GAGAGAGAAT 1921 
AGGACATCTA TAT T AAAAGT TGGGTATCTG 1981 
GAC CT AAAT G TAAAACTGAA AGCTATACAA 2041 
GTGTGATCTT GAGTTTGAAA ATGATTTATT 2101 
TAATTCATAA GTGAACAAAG TGATGAACTG 2161 
TCAAAAGACA CTGGTATGAT AAT GAAGAC A 2221 
TTCTGATAAA GGAATTGTGG CTCAGAATAC 2281 
GTAGCCCAAT TAAAAAAAAA AAAAGAGAAA 2341 
AATTGTAAAA TAAGCACATG AAAAATGCTC 2401 
AAATTAAATG GGAAAGACAC CTCTAATTAG 2 4 61 
AAGTATTGGT GAAGTGGAAA TGTAAAATGA 2521 
TTTCTTACAA AAGTAGGTGT ATACCTACCC 25 81 
TACCTGAGAG AAAT GAAAGA ATACATCCAT 2 641 
GTTTTATTTG TAGTAGCCCC AAACTGAAAA 2701 
TAAACAAAGC GTGGTACAGC AAT GCAAT AG 27 61 
T AGT GAT AT A CATAACAGCT TAAATGTACA 2821 
AGTAAAAAAA AAAAAGAG T A CATGCTGTAT 28 81 
AAAAACAGTC TATACTGACA GAAAGCAGAT 2 941 
AGAGGTGGAG GGAAAATGTA CAAAGTGGCA 30 01 
TCACTATCTT GAT T GAG T GA TGATTTCATG 30 61 
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AGTGCACGTG CGTGTGTCAA AAATGATCAA 
TGTATATATC AATTATACCT CAGTACGGCT 
CAGAACTGAT TCTAGGAAAG AGT GGAGGGA 
GAGAACGGTG CCTTCTGGAA GTGAACTAGG 
AGATGTGTGA GATACTTAGG AAATAAAACC 
ATCCTGGCTT TCCAGAGCAA AGAGCTAGGG 
TGCAGAGGAA GCTTATAGCA GAAACTTACT 
TGGTTCCATT TGATTACTAA AAAACTGCAA 
GTACAAACAG TGAACTTATA TAATATTAAT 
GGGAAGAAAA TGAGATTTGA GCTGGAAGAT 
CCCTAATATG CTTATAGTAG TTAACCTTTC 
TCTTTAAAAT AAAATAACAT AGTAATCCTT 
AGTTTGTTCT ATACACATTA TGTCTTTTAA 
TATTGGCAAC AGTTTTCAGA TGT GGAAACT 
CATAAACTTT GGAGTCTTTC AGACCCAGGT 
AACTTCAGAG CATTATTTCT CACCTTAATT 
CTCTTCACTC CTGTCCATTG TTTCTTCAGT 
TGTAGACACA TAGTTCTCCT GGCTCTGAGA 
CACGGCCCAC TCAATTCCTA TTGAGCAATG 
TGCAATGAGT ATTCAAATGA GGTTGAAGTA 
AT ATT CAAGG AAGCTGTAGC CAT T AAAAT C 
AAAGGGTCTT CAGGCCTTTG TGTTCTGGAA 
ATGCGATTGA TGAGTCTAGA GTCTAGCTCA 
TTTTACTAAT GTGGAACATA GAACAAATTA 
ATAGAGCCTA TCATATCCAG TCTTCCTTGC 
GGTAAGTAAA TGCCTATGAT GTTTGGCTGT 
CCCATCTTTT CTCTGTAGAA GACTCAAAAA 
ACTTTTGGAA CAGAGAGCCT CCCCAGAATC 
CCCAGGTCTC TACTGACATA AAGGAAAGAT 
CTCTCTCTTG ATCCCATTCC CACTTGCATC 
CCTCCACAAA AGAGCAGTTG CAGTAACCCA 
AGAGGCGAAA AGCAGCCCTA TATAGGTTCT 
AAATATGCGA AGGCAACTTG GAAAAGCAAG 
CTCTGGACAA AATTGAGCGC CTATGTGTAC 
ACCTGCTTGT CTGGGTGATT TTGCCTTTGA 
GAACTTCCAG ACAGGAAGAA GGCAGAGAAG 
GAGGGTTCCT AGAGCAAATG GCACAATGCC 
CTAAGGTTTC AGCAAGTATC TGCTGGCTTG 
CTCTCCCACT CTCCCATCTG CGCGCTCTTA 
GAGCAGGTAT TCCTATCGTC CTTTTCCTCC 
ATTGGGCTTT GGAACCAAAT TTCCTGAGTG 
CCTAAACCTT GGTTTAGGAA ACCAGGAGCT 
AGCTAGCCTC TCCTGCCCTC GCCCACGCTG 
GCAGGCGTTA GCGCGCGGTG AGGGGAGGGG 
AGGTGGGAAG GCAAGGAGGC CGGCCCGGTG 
TTTGCTCTCC ACCTCCCAGC GCCCCCTCCG 
CGGGACGGTC CGGAGCAAGC CCACAGGCAG 
TAGCCGCTCC AGTGCTGTAC AGGAGCCGAA 
CCAGCGACAG CCAACGCCTC TTGCAGCGCG 
CTTTCCTCTT CGGTGAAGTT TTTAAAAGCT 



TTTATGCAAC TTTAAATATG TGCAGTTTAT 3121 
ATTAAAAAGA AACCCTCTGG CTGCACAATG 3181 
GGATGACCAT TTACAGTGCT CCAGGTGGAA 3241 
TTGGCAACAA CAGAGATGAA ATAAAT GGGC 3301 
CGATGGTCAC CATTTTCCAA AGGTCAGCTC 3361 
AAGACTTTAT TAATAAATCC CTCTTGAAGT 3421 
CTCAACCTGA CTAATCTGAG AGAACACCTC 3 481 
AGAACAGGAG GAGAAAGAAG AAGAAAGCTG 3541 
CAATAATTGT CTCTTGTTCT TAAAAGCAAT 3 601 
CAGAGTTCAA AATCCAAATA AAGT AT AT GG 3661 
CTGATAATGA TATAATTGTT GACAGCACCA 37 21 
CAGATTTGTA GAAGATCTTT CCTGTTTACA 37 81 
ATGACACACT AGCCTTCTGA GGGTAACTTA 38 41 
GTGAAGACAA TGTTGGTGAT GTGGAAGCAA 3901 
TTGAATGTCA GACTGCTTTT TATTCAGAGT 3961 
TTTTTTCAGG CCTCTTTGTG TCTATGTGTC 4021 
GATTTTTGCC ACCTTCCTTC ACTGTTAGTG 4 081 
GCCTATGTTA ATTCCATTCT ACCATCCTGC 4141 
CTAGTTGAAA GTTGTGGTGG GATTAAATGT 42 01 
TCTACGCATT CTACTTACAT ATGGTGAGGT 42 61 
TCAGGAAATA ATTTTTCACC TCCTCAGGTG 4321 
GGTTCATTTA TAGCCATTTC CCAAATGACA 4 381 
AATAGCAATG GACTGGAAGA CTAGTTTAGG 4441 
TGTCCTTGTT TCAGCCTGTT CATCTGTGAA 4501 
CTTTAGGTTT GAGTTACCTT CTTTGGTCAA 45 61 
GCACAAGATA AAGCTACAAC AAAGCTACAA 4 621 
GCAAAAGAGA CCCAGGAAAA TCTCGGAATG 4 681 
AGAAGTCAAG GAATTTAAAC ATAGGGAAGG 47 41 
GTTTTCTTAT AGGTTTCACG TTTACATTTT 48 01 
TGCCACCTTT ACACAGGGCT TATGGGACCT 48 61 
CATCATCCTC TACGCCCTGG CTGTCCATCA 4 921 
ATCCTTGGAT AGTTCCAGTT GTAAAGTTTA 4 981 
CGGCTGCATA CAAAGCAAAC GTTTACAGAG 50 41 
AT GGCAAGTG TTTTTAGTGT TTGTGTGTTT 5101 
GAGTCTGGAG AG TAGAAGT A CTGGTTAAAG 5161 
AGGGTAGAAA TGACTCTGAT TCTTGGGGCT 5221 
ACGAGGCCCG ATCTATCCCT AT GACGGAAT 52 81 
GTCATGGCTT GCTCCTCAGT T TGT AG GAGA 5341 
TCAGTCCTGA AAAGAACCCC TGGCAGCCAG 5 4 01 
CTCCCTCGCC CCACCCTGTT GGTTTTTTAG 54 61 
CTGGCCTCCA GGAAATCTGG AGCCCTGGCG 5521 
ATTCAGGAAG CAGGGGTCCT CCAGGGCTAG 5581 
CGCCAGCACT TGTTTCTCCA AAGCCACTAG 5 641 
AGAAAAGGAA AGGGGAGGGG AGGGAAAAGG 57 01 
GGGGCGGGAC CCGACTCGCA AACTGTTGCA 57 61 
AGATCCCGGG GAGCCAGCTT GCTGGGAGAG 5821 
AGGAGGCGAC AGAGGGAAAA AGGGCCGAGC 5881 
GGGACGCACC ACGCCAGCCC CAGCCCGGCT 5 941 
GCGGCTTCGA AGCCGCCGCC CGGAGCTGCC 6001 
GCTAAAGACT CGGAGGAAGC AAGGAAAGTG 6061 
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CCTGGTAGGA CTGACGGCTG CCTTTGTCCT CCTCCTCTCC ACCCCGCCTC CCCCCACCCT 6121 
GCCTTCCCCC CCTCCCCCGT CTTCTCTCCC GCAGCTGCCT CAGTCGGCTA CTCTCAGCCA 6181 
ACCCCCCTCA CCACCCTTCT CCCCACCCGC CCCCCCGCCC CCGTCGCCCA GCGCTGCCAG 62 41 
CCCGAGTTTG CAGAGAGGTA ACTCCCTTTG GCTGCGAGCG GGCGAGCTAG CTGCACATTG 6301 
CAAAGAAGGC TCTTAGGAGC CAGGCGACTG GGGAGCGGCT TCAGCACTGC AGCCACGACC 63 61 
CGCCTGGTTA GGCTGCACGC GGAGAGAACC CTCTGTTTTC CCCCACTCTC TCTCCACCTC 64 21 
CTCCTGCCTT CCCCACCCCG AGTGCGGAGC CAGAGATCAA AAGATGAAAA GGCAGTCAGG 64 81 
TCTTCAGTAG CCAAAAAACA AAACAAACAA AAACAAAAAA CAAGAAATAA AAGAAAAAGA 6541 
TAATAACTCA GTTCTTATTT GCACCTACTT CAGT GGAC AC TGAATTTGGA AGGTGGAGGA 6601 
TTTTGTTTTT TTCTTTTAAG ATCTGGGCAT CTTTTGAATC TACCCTTCAA GTATTAAGAG 6661 
ACAGACTGTG AGCCTAGCAG GGCAGATCTT GTCCACCGTG TGTCTTCTTC TGCACGAGAC 6721 
TTTGAGGCTG TCAGAGCGCT TTTTGCGTGG TTGCTCCCGC AAGTTTCCTT CTCTGGAGCT 67 81 
TCCCGCAGGT GGGCAGCTAG CTGCAGCGAC TACCGCATCA TCACAGCCTG TTGAACTCTT 6841 
CTGAGCAAGA GAAGGGGAGG CGGGGTAAGG GAAGTAGGTG GAAGAT T C AG CCAAGCTCAA 6 901 
GGATG 
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CA GGCCCCACAA 
TGGGACCATT TATTGAGCAA TTATTATGTG 
TTATCTCCTT TAACCCCAGC ATAGTATGTC 
ACCAGAGATG CTCAGTCACT TGCCCAAGGT 
TTTTTTTTTT TTTTTTGACA CGGAGTCTCA 
CAATCTCAGC TCACTGCAAG CTCTGCCTCC 
CTGAGTAGCT GGGACTACAG GCACCCGCCA 
CAGAGACAGG GTTTCACCGT GTTAGCCAGG 
CCTGCCTCGG CCTCCCAAAG TGATGGAATT 
TCAAGATTTG AACCCAGGTC CTCTTGGTCC 
TGCATACGCA CCTGTCCCTC TAGAGGTGCC 
GTTAGAAATG CTAGGTATAC CTGTAAAGAG 
AAGAGATGCT GGTGTCCTTC ATTCTCCAGT 
CCAGTATAGC TGCATTCTTG GCTGGGGCAT 
AAAATAAGGA GGCCCAGTTA AATTTGAATT 
AAATATGTCC CAT GCAAT AT CTTGTTGAAA 
TCCCCACCCC TACCACTAGG CCTAAGGAAT 
GAGAAGT GGA ATTAAGCAGG CTAATAGAAG 
TTGGGTGCTG GGTGCCTCCT TAAATAAGCA 
AAGTCTTTTT GGGCTGGGTG CAGTTGCTCG 
AGGCGGGAGG ATCACCTGAG CTTGGGAGTT 
CTGTCTTTAC TAAAAATACA AAAAATTCAG 
AGCTGCTCGG GAGGCTGAGG CAGGAGAATC 
AGCAGAGATC GCGCCATTGC TCTCCAGCCT 
AAAAAAAAAG TCCTTTCGAT GTGACTGTCT 
ATGCTTTTCA GATACTTCAA AGATTCCAGA 
AGGTAAACAC AACACATCCC CCTCCTTGAC 
AACCATTATT T GAT ATT AAA ACAATAGGCT 
AAGTTAGATA ACTGAGACTT AAAGGGTGTT 
GACTCCGGGG GAGGGGGCAG AGTCACCAGC 
TGTGAGAACG GCTGCAGGCA ACCCAGGCGT 
CGCGAAGAGA GGGAGAAAGT GAAGCTGGGA 
AGTTGGAGGG GGCGAGCTGG GAGCGCGCTT 
AGGAGGAGGG CTGCTTGAGG AAGTATAAGA 
TTGGGACCGG AGAAACCAGG GGAGCCCCCC 
CTTTACTGCG CCGCGCGCCC GGCCCCCACC 
CAGCCGGGTC CAGCCGGAGC CATGG 



AACCTAGATC TGCCCCAGTA TAACTAAATC 1501 
CCAAGTATTG CGCTGAGTGC TTCCAGAGCA 1561 
AGATGCTGTT TTACAGATGA GCCAACTGAG 1621 
GACATGACTG ATATGGAATA GAGTCAAGAT 1681 
CTCTGTCTCC CAGGCTGGAG TGCAGAGGCG 1741 
CAGGTTCACG CATTCTCCTG CCTCAGCCTC 1801 
CCACACCTGG CTAATTTTTT GTATTTTTAG 18 61 
ATGGTCTCGA TCTCCTGACC TCGTGATCTG 1921 
ACAGGTGTGA GCCACCGCGA CTGGCCAGAT 1981 
CAGAGGCCCC TGTTTCTCAA CTCCCTAGCA 2041 
TGCTTAAGTG TGCTCAGCAC ATGGAAGCAA 2101 
GTGTGGGAGA TGGGGGGGAG GGAAGAGAGA 2161 
CCCTGATAGG TGCCTTTGAT CCCTTCTTGA 22 21 
TCCAACTAGA ACTGCCAAAT TTAGCACATA 22 81 
TCAGATAAAC AATGAATAAT TTGTTAGTAT 2341 
TTAAAAAAAA AAAAAAAAGT CTTCCTTCCA 24 01 
AGGGTCAGGG GCTCCAAATA GAATGTGGTT 24 61 
GCAAGGGGCA AAGAAGAAAC CTTGAATGCA 2521 
AGAAGGGTGC ATTTTGAAGA AT T GAGAT AG 2581 
TGGTTGTAAT TCCAGCACTT TGGGAGGCTG 2 641 
CAAGACCAGC CTCACCAACG TGGAGAAACC 27 01 
CTGGTCATGG TGGCACATGC CTGTAATCCC 27 61 
ACTTGAACCA GGGAGGCAGA GGTTGTGGTG 2821 
GGGCAACAAG AGCAAAAGTT CGTTTAAAAA 2 8 81 
CCTCCCAAAT TTGTAGACCC TCTTAAGATC 2 941 
AGATATGCCC CGGGGGTCCT GGAAGCCACA 3001 
TATCAATTTT AC T AGAG GAT GTGGTGGGAA 3 0 61 
TGGGATGGAG TAGGATGCAA GCTCCCCAGG 3121 
AAGAGTGGCA GCCTAGGGAA ATTTATCCCG 3181 
CTCTGCATTT AGGGATTCTC C GAG GAAAAG 3241 
CCCGGCGCTA GGAGGGACGA CCCAGGCCTG 3301 
GTTGCCGACT CCCAGACTTC GTTGGAATGC 3361 
GCTCCCAATC ACCGGAGAAG GAGGAGGTGG 3421 
ATGAAGTTGT GAAGCT GAGA TTCCCCTCCA 3481 
GGGCAGCCGC GCGCCCCTTC CCACGGGGCC 3541 
CCTCGCAGCA CCCCGCGCCC CGCGCCCTCC 3 601 
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